Remote Devops Jobs · Distributed Systems

Job listings

  • Architect and run mission-critical operations for Pyth's Price Feeds.
  • Ensure feeds operate flawlessly 24/7.
  • Lead data forensics to quantify impact and drive solutions that measurably reduce recurrence.

Pyth provides real-time, distributed market data infrastructure. We aim to hire someone who can help us uplift the service management aspects of Pyth's Price Feeds.

US Unlimited PTO

  • Invent new managed compute primitives that feel first-class in Temporal Cloud.
  • Design self-optimizing autoscaling systems that scale worker fleets safely and predictably.
  • Architect, build, and operate services on the hot path of task execution where performance and correctness are customer-visible.

Temporal provides an open-source programming model that simplifies code and enhances application reliability. They aim to be the reliable foundation of every developer’s toolbox. Temporal is a growing company that values curiosity, drive, collaboration, authenticity, and humility.

  • Own and operate GPU and accelerator clusters for AI training, inference, and experimentation, ensuring reliability and cost-efficiency.
  • Build and optimize scheduling, orchestration, and serving systems using frameworks like vLLM and Triton to improve latency, throughput, and memory efficiency.
  • Partner with ML engineers to remove workflow bottlenecks and build observability for GPU utilization, capacity, and incident response.

Kraken is a crypto exchange platform building premium financial products for traders and institutions, accelerating global crypto adoption. It is a mission-driven, fully remote company with a world-class team of crypto experts spread across more than 70 countries.

  • Infrastructure & scale testing: Build and maintain Docker/Kubernetes harnesses for repeatable stress and scale testing as a pre-release gate, while managing multi-cloud infrastructure with reliability and cost targets.
  • Release engineering & CI/CD: Consolidate CI/CD into reusable workflows, owning version tagging, hotfix paths, PR review scaffolding, and the full release lifecycle.
  • Observability & security: Implement a single pane for metrics, traces, and logs powered by OpenTelemetry, while managing security workflows, vulnerability triage, and incident response.

Sei Labs builds open-source technology for the high-performance Sei Blockchain, a parallelized EVM Layer 1 designed to scale with the industry. The team comprises veterans from major tech and finance firms and is dedicated to onboarding the next billion users to Web3.

US 3w PTO

  • Manage and improve CI/CD pipelines (GitHub Actions, Terraform, HashiCorp stack).
  • Orchestrate compute workloads across cloud and bare-metal GPU clusters and design systems for secure deployment.
  • Build internal tooling to accelerate developer workflows and monitor system performance under high computational loads.

Atomic Industries is reinventing how the world makes things by developing an AI-driven platform to accelerate the production of manufacturing tools and molds. The company operates a fully operational factory in Detroit, combines industrial expertise with Silicon Valley innovation, and is backed by top-tier investors.

  • Design and operate highly scalable, fault-tolerant systems that support production workloads across a distributed cloud environment.
  • Build and improve observability systems to provide deep visibility into system behavior and performance.
  • Partner with engineering teams to design systems with reliability and scalability built in from the start.

Fieldguide is establishing a new state of trust for global commerce and capital markets through automating and streamlining the work of assurance and audit practitioners, specifically within cybersecurity, privacy, and financial audit. It is a remote-first, values-driven company backed by top investors, building an inclusive and supportive team to create the future of audit and advisory software.